Voice Timbre Control Based on Perceived Age in Singing Voice Conversion

نویسندگان

Kazuhiro Kobayashi

Tomoki Toda

Hironori Doi

Tomoyasu Nakano

Masataka Goto

Graham Neubig

Sakriani Sakti

Satoshi Nakamura

چکیده

The perceived age of a singing voice is the age of the singer as perceived by the listener, and is one of the notable characteristics that determines perceptions of a song. In this paper, we describe an investigation of acoustic features that have an effect on the perceived age, and a novel voice timbre control technique based on the perceived age for singing voice conversion (SVC). Singers can sing expressively by controlling prosody and voice timbre, but the varieties of voices that singers can produce are limited by physical constraints. Previous work has attempted to overcome this limitation through the use of statistical voice conversion. This technique makes it possible to convert singing voice timbre of an arbitrary source singer into those of an arbitrary target singer. However, it is still difficult to intuitively control singing voice characteristics by manipulating parameters corresponding to specific physical traits, such as gender and age. In this paper, we first perform an investigation of the factors that play a part in the listener’s perception of the singer’s age at first. Then, we applied a multiple-regression Gaussian mixture models (MR-GMM) to SVC for the purpose of controlling voice timbre based on the perceived age and we propose SVC based on the modified MR-GMM for manipulating the perceived age while maintaining singer’s individuality. The experimental results show that 1) the perceived age of singing voices corresponds relatively well to the actual age of the singer, 2) prosodic features have a larger effect on the perceived age than spectral features, 3) the individuality of a singer is influenced more heavily by segmental features than prosodic features 4) the proposed voice timbre control method makes it possible to change the singer’s perceived age while not having an adverse effect on the perceived individuality. key words: singing voice, voice conversion, perceived age, spectral and prosodic features, subjective evaluations

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical approach to perceived age control of singing voice

The perceived age of a singing voice is the age of the singer as perceived by the listener, and is one of the notable characteristics that determines perceptions of a song. Singers can sing expressively by controlling prosody and voice timbre, but the varieties of voice timbre that singers can produce are limited by physical constraints. Previous work has attempted to overcome the limitation th...

متن کامل

Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion

As one of the techniques enabling individual singers to produce the varieties of voice timbre beyond their own physical constraints, a statistical voice timbre control technique based on the perceived age has been developed. In this technique, the perceived age of a singing voice, which is the age of the singer as perceived by the listener, is used as one of the intuitively understandable measu...

متن کامل

An investigation of acoustic features for singing voice conversion based on perceptual age

In this paper, we investigate the acoustic features that can be modified to control the perceptual age of a singing voice. Singers can sing expressively by controlling prosody and vocal timbre, but the varieties of voices that singers can produce are limited by physical constraints. Previous work has attempted to overcome this limitation through the use of statistical voice conversion. This tec...

متن کامل

Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion

In this paper, we evaluate our proposed singing voice conversion method from various perspectives. To enable singers to freely control their voice timbre of singing voice, we have proposed a singing voice conversion method based on many-tomany eigenvoice conversion (EVC) that enables to convert the voice timbre of an arbitrary source singer into that of another arbitrary target singer using a p...

متن کامل

Applying voice conversion to concatenative singing-voice synthesis

This work address the application of Voice Conversion to singing-voice. The GMM-based approach was applied to VOCALOID, a concatenative singing synthesizer, to perform singer timbre conversion. The conversion framework was applied to full-quality singing databases, achieving a satisfactory conversion effect on the synthesized utterances. We report in this paper the results of our experimentatio...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEICE Transactions

دوره 97-D شماره

صفحات -

تاریخ انتشار 2014

Voice Timbre Control Based on Perceived Age in Singing Voice Conversion

نویسندگان

چکیده

منابع مشابه

Statistical approach to perceived age control of singing voice

Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion

An investigation of acoustic features for singing voice conversion based on perceptual age

Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion

Applying voice conversion to concatenative singing-voice synthesis

عنوان ژورنال:

اشتراک گذاری